Search CORE

29 research outputs found

Detecting deceptive behaviour in the wild:text mining for online child protection in the presence of noisy and adversarial social media communications

Author: Peersman Claudia
Publication venue: Lancaster University
Publication date: 01/01/2018
Field of study

A real-life application of text mining research “in the wild”, i.e. in online social media, differs from more general applications in that its defining characteristics are both domain and process dependent. This gives rise to a number of challenges of which contemporary research has only scratched the surface. More specifically, a text mining approach applied in the wild typically has no control over the dataset size. Hence, the system has to be robust towards limited data availability, a variable number of samples across users and a highly skewed dataset. Additionally, the quality of the data cannot be guaranteed. As a result, the approach needs to be tolerant to a certain degree of linguistic noise. Finally, it has to be robust towards deceptive behaviour or adversaries. This thesis examines the viability of a text mining approach for supporting cybercrime investigations pertaining to online child protection. The main contributions of this dissertation are as follows. A systematic study of different aspects of methodological design of a state-ofthe- art text mining approach is presented to assess its scalability towards a large, imbalanced and linguistically noisy social media dataset. In this framework, three key automatic text categorisation tasks are examined, namely the feasibility to (i) identify a social network user’s age group and gender based on textual information found in only one single message; (ii) aggregate predictions on the message level to the user level without neglecting potential clues of deception and detect false user profiles on social networks and (iii) identify child sexual abuse media among thousands of legal other media, including adult pornography, based on their filename. Finally, a novel approach is presented that combines age group predictions with advanced text clustering techniques and unsupervised learning to identify online child sex offenders’ grooming behaviour. The methodology presented in this thesis was extensively discussed with law enforcement to assess its forensic readiness. Additionally, each component was evaluated on actual child sex offender data. Despite the challenging characteristics of these text types, the results show high degrees of accuracy for false profile detection, identifying grooming behaviour and child sexual abuse media identification

Lancaster E-Prints

Explore Bristol Research

Scamming the scammers:towards automatic detection of persuasion in advance fee frauds

Author: Edwards Matthew John
Peersman Claudia
Rashid Awais
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 03/04/2017
Field of study

Advance fee fraud is a significant component of online criminal activity. Fraudsters can often make off with significant sums, and victims will usually find themselves plagued by follow-up scams. Previous studies of how fraudsters persuade their victims have been limited to the initial solicitation emails sent to a broad population of email users. In this paper, we use the lens of scam-baiting – a vigilante activity whereby members of the public intentionally waste the time of fraudsters – to move beyond this first contact and examine the persuasive tactics employed by a fraudster once their victim has responded to a scam. We find linguistic patterns in scammer and baiter communications that suggest that the mode of persuasion used by scammers shifts over a conversation, and describe a corresponding stage model of scammer persuasion strategy. We design and evaluate a number of classifiers for identifying scam-baiting conversations amidst regular email, and for separating scammer from baiter messages based on their textual content, achieving high classification accuracy for both tasks. This forms a crucial basis for automated intervention, with a tool for identifying victims and a model for understanding how they are currently being exploited

Lancaster E-Prints

Explore Bristol Research

A Rapid Evidence Assessment of Technical Tools for the Detection and Disruption of Child Sexual Abuse Media (CSAM) and CSAM Offenders in The ASEAN Region

Author: Barbrook John
Brennan Maggie
May-Chahal Corinne
Mei Peidong
Mills Emma
Peersman Claudia
Rashid Awais
Publication venue
Publication date: 01/03/2022
Field of study

Explore Bristol Research

Ethical and Social Challenges with developing Automated Methods to Detect and Warn potential victims of Mass-marketing Fraud (MMF)

Author: Edwards Matthew John
Levi Michael
Peersman Claudia
Rashid Awais
Sasse Angela
Sorell Tom
Stringhini Gianluca
Whitty Monica
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 01/04/2017
Field of study

Mass-marketing frauds (MMFs) are on the increase. Given the amount of monies lost and the psychological impact of MMFs there is an urgent need to develop new and effective methods to prevent more of these crimes. This paper reports the early planning of automated methods our interdisciplinary team are developing to prevent and detect MMF. Importantly, the paper presents the ethical and social constraints involved in such a model and suggests concerns others might also consider when developing automated systems

Online Research @ Cardiff

UCL Discovery

Warwick Research Archives Portal Repository

Lancaster E-Prints

Explore Bristol Research

ABOP, automatic optimization of patient information leaflets

Author: Cardey Seditor
Delaere IsabelleUGent000100939210802000108756975383250931FA57EEB6-F0ED-11E1-A9DE-61C894A0A6B4
Hoste VeroniqueLW228020002478890000-0002-0539-4630F93F00BE-F0ED-11E1-A9DE-61C894A0A6B4
Peersman Claudia
Van Vaerenbergh Leona
Velaerts PeterLW22802000691160220C88EA-F0EE-11E1-A9DE-61C894A0A6B4
Publication venue: Université de Franche-Comté
Publication date: 01/01/2009
Field of study

Ghent University Academic Bibliography

Archivsystem Ask23

iCOP:live forensics to reveal previously unknown criminal media on P2P networks

Author: Peersman Claudia
Schulze Christian
Rashid Awais
Brennan Margaret
Fischer Carl
Publication venue
Publication date: 01/01/2012
Field of study

The increasing levels of criminal media being shared in peer-to-peer (P2P) networks pose a significant challenge to law enforcement agencies. One of the main priorities for P2P investigators is to identify cases where a user is actively engaged in the production of child sexual abuse (CSA) media – they can be indicators of recent or on-going child abuse. Although a number of P2P monitoring tools exist to detect paedophile activity in such networks, they typically rely on hash value databases of known CSA media. As a result, these tools are not able to adequately triage the thousands of results they retrieve, nor can they identify new child abuse media that are being released on to a network. In this paper, we present a new intelligent forensics approach that incorporates the advantages of artificial intelligence and machine learning theory to automatically flag new/previously unseen CSA media to investigators. Additionally, the research was extensively discussed with law enforcement cybercrime specialists from different European countries and Interpol. The approach has been implemented into the iCOP toolkit, a software package that is designed to perform live forensic analysis on a P2P network environment. In addition, the system offers secondary features, such as showing on-line sharers of known CSA files and the ability to see other files shared by the same GUID or other IP addresses used by the same P2P client. Finally, our evaluation on real CSA case data shows high degrees of accuracy, while hands-on trials with law enforcement officers demonstrate the toolkit’s complementarity to extant investigative workflows

Elsevier - Publisher Connector

Crossref

Plymouth Electronic Archive and Research Library

Cork Open Research Archive

Lancaster E-Prints

Explore Bristol Research

The Geography of Online Dating Fraud

Author: Edwards Matthew
Peersman Claudia
Rashid Awais
Stringhini Gianluca
Suarez-Tangil Guillermo
Whitty Monica
Publication venue
Publication date: 24/05/2018
Field of study

This paper presents an analysis of online dating fraud’s geography. Working with real romance scammer dating profiles collected from both proxied and direct connections, we analyse geographic patterns in the targeting and distinct characteristics of dating fraud from different countries, revealing several strong markers indicative of particular national origins having distinctive approaches to romance scamming. We augment IP geolocation information with other evidence about the dating profiles. By analysing the resource overlap between scam profiles, we discover that up to 11% of profiles created from proxied connections could be assigned a different national origin on the basis of text or images shared with profiles from direct connections. Our methods allow for improved understanding of the origins of dating fraud, beyond only direct geolocation of IP addresses, with patterns and resource sharing revealing approximate location information which could be used to target prevention campaigns

UCL Discovery

King's Research Portal

Explore Bristol Research

Scoping the Cyber Security Body of Knowledge

Author: Chivers Howard
Danezis George
Lewis Makayla
Lupu Emil
Martin Andrew
Peersman Claudia
Rashid Awais
Publication venue
Publication date: 01/05/2018
Field of study

Cybersecurity is becoming an important element in curricula at all education levels. However, the foundational knowledge on which the field of cybersecurity is being developed is fragmented, and as a result, it can be difficult for both students and educators to map coherent paths of progression through the subject. The Cyber Security Body of Knowledge (CyBOK) project (www.cybok.org) aims to codify the foundational and generally recognized knowledge on cybersecurity.</p

UCL Discovery

Oxford University Research Archive

Kingston University Research Repository

Explore Bristol Research